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Production of recombinant proteins for use as pharmaceuticals, 
so-called biopharmaceuticals, is a multi-billion dollar industry. 
Many different cell factories are used for the production of 
biopharmaceuticals, but the yeast Saccharomyces cerevisiae is 
an important cell factory as it is used for production of several 
large volume products. Insulin and insulin analogs are by far 
the dominating biopharmaceuticals produced by yeast, and 
this will increase as the global insulin market is expected to 
grow from USD12Bin2011 to more than USD32B by 2018. Other 
important biopharmaceuticals produced by yeast are human 
serum albumin, hepatitis vaccines and virus like particles used 
for vaccination against human papillomavirus. Here is given a 
brief overview of biopharmaceutical production by yeast and 
it is discussed how the secretory pathway can be engineered 
to ensure more efficient protein production. The involvement 
of directed metabolic engineering through the integration 
of tools from genetic engineering, systems biology and 
mathematical modeling, is also discussed. 



The introduction of genetic engineering by Cohen and Boyer in 
1973 1 laid the fundament for the current biotech industry, which 
is based on using microorganisms or cell cultures for production 
of proteins that can serve as pharmaceuticals, often referred to as 
biopharmaceuticals. 2,3 A few years later, researchers at Genentech 
cloned the genes for human insulin and growth hormone, and 
expressed them in Escherichia coli, 4 hereby demonstrating the 
utility and applicability of genetic engineering in creating geneti- 
cally engineered bacteria that produce these two human proteins. 
In 1982 this led to marketing of the first biopharmaceutical, 
human insulin, by Eli Lilly, who licensed the technology from 
Genentech. In 1985 Genentech received FDA approval to mar- 
ket their own first product, Protropin®, the human growth hor- 
mone to be used for children with growth hormone deficiency. 
In 1987 this was followed by the tissue-plasminogen activator 
(t-PA, Activase®), another Genentech product, an enzyme that 
can resolve blood clots in patients with acute myocardial infarc- 
tion. Also in 1987 Novo (now Novo Nordisk), a major insulin 
producing company, launched human insulin produced by the 
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yeast Saccharomyces cerevisiae as a replacement for their human 
insulin enzymaticaiiy derived from porcine insulin. Shortly 
following these early developments many other products were 
launched and today there are more than 300 biopharmaceuti- 
cal proteins and antibodies on the market with sales exceed- 
ing USD100B, 5,6 with monoclonal antibodies representing the 
majority (> USD18B) followed by hormones (> USD11B) and 
growth factors (> USD10B). 7 Furthermore, biopharmaceuticals 
have the fastest growth in the marked with an annual growth of 
about 19%, 8 and there are currently more than 240 monoclonal 
antibodies and 120 recombinant proteins in clinical trials. 9 

About 40% of the biopharmaceuticals are currently being pro- 
duced by mammalian cell cultures, mainly using Chinese Hamster 
Ovarian cell lines (CHO cells), as these allow for production of 
proteins with very similar glycosylation patterns as human pro- 
teins. 10,11 E. coli is used as cell factory for production of another 
30% of the biopharmaceuticals whereas 20% are being produced 
by S. cerevisiae. 10,11 The dominant biopharmaceuticals produced by 
S. cerevisiae are insulin (and insulin analogs), human serum albu- 
min, hepatitis vaccines and virus like particles, e.g., for vaccination 
against human papillomavirus. The advantages of using yeast S. 
cerevisiae as a cell factory for the production of biopharmaceuticals 
are that this eukaryal model system enables production and proper 
folding of many human proteins. Furthermore, the proteins can 
be secreted to the extracellular medium and this facilitates subse- 
quent purification. A further advantage is that in many cases yeast 
can perform proper post-translational modifications of the protein, 
including proteolytic processing of signal peptides, disulfide bond 
formation, subunit assembly, acylation and glycosylation. 12 S. cere- 
visiae is also widely used as an eukaryal model organism 13,14 and 
there is therefore much information available about this organism 
through high-throughput studies, 15 databases, sequenced genomes 
and extensive toolbox for molecular modification, which provides 
an extensive knowledge base for further engineering of this organ- 
ism. One of the limitations with the use of yeast is, however, that 
it performs high-mannose type N-glycosylation. This confers 
a short half-life of the modified protein in vivo, which then can 
have a reduced efficacy for therapeutic use. 16 Much work has been 
performed on engineering yeasts, both S. cerevisiae and Pichia 
pastoris, so that they can carry out human-like N-glycosylation 
patterns that even includes terminal addition of sialic acid to the 
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glycoprotein. 16 " 18 This has opened for an even wider use of S. cere- 
visiae as a cell factory for production of biopharmaceuticals and 
there is therefore much interest in further engineering of yeast for 
ensuring efficient production of recombinant proteins. 

More than 40 different recombinant proteins have been 
expressed, produced and secreted by S. cerevisiae} 2 This includes 
several biopharmaceuticals and Table 1 provides an overview of 
some of these products, i.e., the protein name, their therapeutic 
application, leader sequence used and the titer reported in the 
publically available literature. As the table illustrates there are 
basically used three different types of leader sequences to ensure 
efficient secretion of the protein through the secretory pathway. 
S. cerevisiae only secretes few proteins to the extracellular medium, 
with the a-factor (a yeast hormone involved in mating) being 
the most studied and therefore most frequently used for efficient 
secretion of recombinant proteins. To further improve protein 
secretion in yeast Kjeldsen and coworkers at Novo Nordisk devel- 
oped a synthetic leader that has been shown to be very efficient 
in protein secretion. 12,15 As illustrated in Table 1, yeast can, how- 
ever, also secrete human proteins that are expressed with their 
native leader sequences. 

The secretory pathway in yeast is quite complex (see Fig. 1 for 
a schematic overview) as it involves more than 160 proteins that 
are responsible for different post-translational processes, e.g., fold- 
ing and glycosylation. The secretory pathway handles more than 
550 proteins that have a signal peptide in the yeast proteome, but 
only very few of these proteins are secreted to the extracellular 
matrix as all proteins targeted to the endoplasmic reticulum (ER), 
Golgi, vacuole and cytoplasmic membrane are also processed 
through the secretory pathway. Protein folding in the ER is of 
key importance for the secretory pathway as the accumulation of 
mis-folded proteins results in ER stress that is handled then by the 
unfolded protein response (UPR). Activation of the UPR results 
in transcriptional change of about 400 genes, 20 many of which 
are under regulation of the Haclp transcription factor. 21 A result 
of this regulation is upregulation of chaperones and foldases as 
well as ER associated degradation (ERAD). Based on studies of 
the UPR many targets for improving protein secretion have been 
identified and implemented, 12 and it is generally believed that any 
factor that reduces ER stress and its downstream damage caused 
by heterologous protein production, results in improved secretion 
of the produced protein. 

Due to the complexity of the protein secretion pathway there 
has traditionally been focus on transcriptional regulation of pro- 
tein production. A large number of different promoters have 
been evaluated for driving expression of the heterologous genes, 



and among the most widely used promoters are strong glycolytic 
promoters like pTDH3, pPGKl or pTPIl, pADHl, or galactose 
induced promoters like pGALl, pGAL7 or pGALlO. 12 However, 
even though changing expression may improve production of one 
protein, it does not imply there would be improvement in produc- 
tion of another protein, as clearly demonstrated in our recent study 
on production of insulin precursor and a-amylase. 22 In this study 
we used an expression system originally developed by Novo for 
their insulin production. This expression system involves deletion 
of the TPI1 gene in the chromosome and use of the corresponding 
gene, POT1, from Schizosaccharomyces pombe as a plasmid marker 
(see Fig. 2) . This gives a very stable construct as all cells losing the 
plasmid will be deficient in triose-phosphate isomerase activity, a 
key glycolytic enzyme. A particular strength of this method is that 
it is stable also with use of complex media containing amino acids 
and nucleotides, which is not the case with standard yeast auxot- 
rophy markers. Using this vector system we evaluated two differ- 
ent promoters (pTPIl and pTEFP) as well as two different leader 
sequences (synthetic leader and the a-factor leader) for production 
of insulin precursor and a-amylase, and found that there was con- 
siderable differences in production of the two proteins with the 
different expression systems evaluated, i.e., with low gene expres- 
sion a-amylase was produced at higher levels, whereas for high 
gene expression system insulin precursor was produced at higher 
levels. 22 This pointed to very different protein processing in the 
ER, i.e., the larger and more complex a-amylase is more challeng- 
ing for ER-processing when the flux is high. This hypothesis was 
confirmed by a measured increase in ER stress (by genome-wide 
transcription analysis) in the a-amylase producing strains. Based 
on this analysis it was hypothesized that engineering of the down- 
stream secretion pathway may be able to improve the secretion 
of amylase, and indeed overexpressing regulators of the so-called 
SNARE complex, Seclp and Slylp, resulted in improved protein 
secretion. 23 Moreover, it was found that overexpression of SEC1, 
that is involved in regulating vesicle trafficking from Golgi to the 
cell membrane, resulted in improved production of both insulin 
precursor and a-amylase, whereas overexpression of SLY1, that is 
involved in regulating the vesicle fusion from ER to Golgi, increased 
only the a-amylase production. 23 Through combined overexpres- 
sion of the Seclp and Slylp the overall secretion of a-amylase could 
be improved by about 70%, whereas insulin precursor production 
was increased by about 30%, and the study therefore clearly dem- 
onstrates that engineering of the secretory pathway can result in 
significant improvement of recombinant protein production. 

There are many other studies that clearly demonstrate that 
engineering of the secretory pathway can result in improved 



Table 1. Overview of some biopharmaceuticals produced by S. cerevisiae" 



Type 


Protein 


Therapeutic application 


Leader sequence 


Titer 


Blood related 


Human Serum Albumin 


Surgery (plasma expander) 


Native 


3g/L 




Hirudin 


Blood coagulation disorders 


a-Factor 


460 mg/L 




Human transferrin 


Anemia 


Native 


1.8 g/L 


Hormones 


Insulin Precursor 


Diabetes 


Synthetic 


80 mg/L 




Glucagon 


Diabetes 


a-Factor 


17.5 mg/L 


Antigen 


Hepatitis surface antigen 


Hepatitis vaccination 


Native 


19.4 mg/L 
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Figure 1. Schematic overview of the secretory pathway in yeast. Proteins targeted for secretion enter the endoplasmic reticulum (ER). If they fold 
correctly they can enter the secretory pathway, whereas misfolded protein cause ER stress leading to the activation of the unfolded protein response 
(UPR) that results in activation of a very large number of cellular processes, including activation of chaperones and foldases (like BIP and PDI) that 
assist with refolding. UPR is also upregulating ER-associated degradation (ERAD) where the unfolded proteins are exported from the ER, ubiquitinated 
and hereby targeted for degradation by the proteasome (ubiquitin-proteasome system, UPS). Correctly folded proteins can be exported to the Golgi 
for further processing (including additional glycosylation). The COPI- and COPII-complexes facilitate the ER-Golgi transfer, and from the Golgi the 
protein may be secreted via the endosome or be targeted to the vacuole for storage and/or degradation. Different colors represent different types of 
vesicular compartments of the secretory pathway. 



protein production, 12 but a general finding is that most targets 
are quite specific, depending which particular protein is being 
overexpressed. Ideally, there would be one (or few) efficient yeast 
platform strain(s) that can serve as production host for a wide 
range of different biopharmaceuticals. There is therefore a need 
for a more rational approach to engineering yeast for improved 
protein secretion. In such an approach detailed models of the 
secretory pathway can be used for design, similarly as it has been 
done with engineering of metabolic pathways, where the use of 
genome-scale metabolic models has been shown to be of great 
importance. 15 ' 24 ' 25 This approach is referred to as metabolic engi- 
neering, and the workflow, often called the metabolic engineer- 
ing cycle, 26 is illustrated in Figure 3. 

As illustrated, the workflow involves detailed modeling, often 
based on detailed analysis of the cellular metabolism and physiol- 
ogy using high-throughput experimental techniques developed in 
the field of genomics, and the concept of quantitatively describing 
cellular processes with mathematical models is at the core of sys- 
tems biology. 27 The metabolic engineering cycle is therefore very 
similar to the workflow of many systems biology studies where 
perturbation of the cellular system is performed using genetic 
engineering, e.g., by overexpression or deletion of specific genes, 
followed by detailed analyses that can be used to define a math- 
ematical model of the biological system. However, there is a major 
difference in the sense that systems biology is a fundamental sci- 
ence where the primary objective is to gain novel insights, whereas 
metabolic engineering is an applied science with the primary 
objective to obtain an improved cell factory. Clearly the process 




of metabolic engineering also results in improved insight of the 
cellular metabolism and physiology, but generally to a less extend 



Figure 2. Illustration of the stable expression system with a glycolytic 
gene as the selection marker. One of the glycolytic enzymes is used as 
a marker for plasmid presence: the endogenous gene encoding trios- 
ephosphate isomerase (TP/7) is deleted and the corresponding gene 
{POT1) from Schizosaccharomyces pombe is expressed from a plasmid. 
The same plasmid carries the gene for the heterologous gene to be 
expressed (here demonstrated with a gene encoding human insulin). 
If the plasmid is lost the cells lack a key glycolytic enzyme and the 
glycolytic flux is therefore reduced dramatically resulting in impaired 
growth. Cells that are replicating the plasmid in high copy numbers 
and expressing the genes from the plasmid therefore have an inherent 
growth advantage. 
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Figure 3. Schematic overview of the metabolic engineering cycle where systems biol- 
ogy tools are implemented for design of improved cell factories. Through advanced 
modeling novel targets for genetic engineering can be identified, e.g., to ensure 
improved protein secretion. These targets are evaluated by characterizing the strains in 
bioreactors where rates of biomass growth, sugar consumption, and product formation 
are quantified. Fermentation analysis may be combined with high-throughput analyses, 
or omics analyses, where the transcriptome, proteome, metabolome and fluxome are 
measured. Omics analyses may provide new insights into the cellular metabolism and 
physiology, and this may be used to improve the models, that can hence be used for 
further design. The metabolic engineering cycle is very similar to the workflow of many 
systems biology studies where perturbation of the cellular system is performed using 
genetic engineering, e.g., by overexpression or deletion of specific genes, followed by 
detailed analysis that can be used to define a mathematical model for the biological 
system. 



than in a systems biology study due to the differ- 
ences in study design resulting from the different 
objectives. A major limitation in using the rational 
Metabolic Engineering approach, with model based 
design, for improving protein secretion, is the lack 
of detailed models of the protein secretion path- 
way. Even though there have been described several 
kinetic models, 28,29 these describe only part of the 
pathway. There is therefore an obvious need for a 
genome-scale model for protein secretion, in anal- 
ogy to what has been done for metabolism. 30 Such 
detailed models will not only enable rational design 
of improved protein secretion routes they will also 
enable improved integrative analysis of this com- 
plex pathway using previously developed tools (for 
studying metabolism 31 ) and hereby lead to more 
insights into how the secretory pathway operates as 
a system. 
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